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Abstract 

A probability transformation of a r.v. X with EA^ > and the characteristic function 
(ch.f.) fit) = Ee^''^ is introduced. It is defined as arbitrary r.v. with the ch.f. f*{t) = 
-/"(t)/EA2. For any r.v. A with EA = 0, EA^ > and E|A|3 < oo the precise 
estimate Li(A*,A) ^ E|A|^/EA^ is proved, where A* is any r.v. with the ch.f. f*{t), 
Li(A*, A) is the Li-distance between A and A*. As a corollary, some new estimates for 
the proximity of the ch.f. f(t) to the normal one are proved involving the double integrals 
of the corresponding transformation. 
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1 Introduction 

Let X be a random variable (r.v.) with the distribution function (d.f.) F{x) = P{X < x), 
X G M, and the characteristic function (ch.f.) f{t) = Ee**"^, t G M. As is well known, if X is 
nonnegative with < EA < oo, then 

m 

/'(O)' 

is a ch.f., and if < EA^ < oo, then 



t G 



fit) - f'{o) nt) 
tno) ' no) 

are ch.f.'s as well (see, e.g., |13[ Theorem 12.2.5]). The probability transformation given by ([1]) is 
called the X-size bias transformation. By a transformation of a random variable we mean that 
of its distribution. The A-size bias transformation was introduced by Goldstein and Rinott [5] 
for the purpose of estimation of the accuracy of the multivariate normal approximation to 
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nonnegative random vectors under conditions of local dependence by Stein's method. Namely, 
in [5] an almost surely nonnegative r.v. X^'^^ with < EX < oo is said to have the X-size 
biased distribution if 

dP{X^'^ <x) = -^ dF{x), xeR. (2) 
EX 

It is easy to see that the distribution given by ([2]) has the ch.f. given by ([T]), hence, by virtue 
of the uniqueness theorem, definitions ([T]) and ([2]) are equivalent. In the same paper Goldstein 
and Rinott also noticed that the distribution of X'^''^ may be characterized by the relation 

EXG(X) = EXEG(X(")), 

which should hold for all functions G such that EXG(X) < oo. 

As regards the second transformation, if EX = 0, then the distribution given by the ch.f. 

fit) - no) 1 m 

t/"(0) a2 t ' 

where = EX^ > 0, is called the X-zero biased distribution. This definition was introduced 
by Goldstein and Reinert in [3] in an equivalent form for the purpose of generalization of the 
size bias transformation to r.v.'s taking both positive and negative values and was inspired 
by the characteristic property of the mean zero normal distribution as the unique fixed point 
of the zero bias transformation. Namely, in [3] a r.v. X^^^ is said to have the X-zero biased 
distribution if 

EXG'(X) = (T^EG"(X(^)) 

for all absolutely continuous functions G for which EXG'(X) exists. The zero biased transfor- 
mation possesses the following elementary properties (most of them are noticed/proved in [3]): 

1. The zero biased distribution is absolutely continuous and unimodal about zero with the 
probability density function 

p{x) = a-^EXI{X > x), X e M, 

and the ch.f. 

Ee^*^^^' ^ ■ tGM. 
t 

2. X*^^) = X if and only if X has the normal distribution with zero mean [T71 [3]. 

3. The zero biased transformation preserves symmetry. 

4. (t2E(X(^))" = EX"+V(n + 1) for n e N, in particular, cr^EX^^) = O.SEX^. 

5. If X = Yi + . . . K„, where Yl, . . . , y„ are independent r.v.'s with zero means and EY^' = 
cr| > so that (7^ + . . . + (T,^ = (T^, then X^^^ = Xj + Yj'^\ where / is a random index 
independent of Yi, . . . ,Yn with the distribution P(/ = i) = af/a"^, i = l,...,n, and 
Xi = X - Yi = Y.jjLi Yj- 
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6. The following non-trivial fact was proved in 2009 independently by Goldstein [2] and 
Tyurin [JSIDE]: 

(3) 

Li{X, Y) being the Li-distance between the r.v.'s X and Y, 

Li{X,Y) = inf |E|X'-r'|: X' = X, F' = F j , E|X| < oo, E\Y\ < oo. 

As regards the third transformation, we give here the following 

Definition 1. Let X be a r.v. with the ch.f. f{t) and = EX^ e (0, oo). The distribution 
given by the ch.f. 

f"(f) 

rit) ^ t G M, (4) 

is called the X-shape biased distribution. 
It is easy to see that 

1. A r.v. X* has the X-shape biased distribution if and only if its d.f. F*{x) satisfies 

dF*{x) = — dF{x), xeR. (5) 

2. A r.v. X* has the X-shape biased distribution if and only if for all functions G for which 
EX^G{X) exists, 

EX^G{X) = a^EG{X*). (6) 

3. X* = X if and only if P(|X| = cr) = 1, i.e. any Bernoulli distribution with symmetric 
atoms is a fixed point of the shape bias transformation. This can be verified by noticing 
that the solution of the corresponding linear homogeneous differential equation f"(t) + 
cr'^fit) = of the second order with the initial condition /(O) = 1 has the form f{t) = 
p^tcrt _|_ _ p^g-^o"*^ p G M, being a ch.f. if and only if p G [0, 1]. 

4. (X*)2 = (X2)(*), where (X2)(^) has the X^ -size biased distribution. 

5. (cX)* = cX* for any constant c G M. 

6. (t2E(X*)" = EX"+2, for n G N, in particular, a^EX* = EX^. 

Moreover, the following estimate for the proximity between X and X* in terms of Li-metric 
will be proved in this paper. 

Theorem 1. // EX = 0, EX^ = 1 and E|Xp < oo, then 

Li(X,X*) ^ E|X|^ 

moreover, for any e > there exists a distribution of a r.v. X concentrated in two points, such 
that EX = 0, EX2 = 1, E\X\^ < oo, and 

Li(X,X*) > {l-e)E\X\\ 
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The existence of the shape bias transformation follows from the earlier result of [13] men- 
tioned above. Moreover, in 2005, Goldstein and Reinert j3] proved the existence of a class of 
transformations of probability distributions that is characterized by equations like Namely, 
the authors described a class of measurable functions T : R — )■ M that provide the existence and 
uniqueness of the distribution of a random variable X*^^^ such that 



ET(X)G(X) = EG(™)(X(^)) ■ ^^"^^^^ 

ml 



for all measurable m times differentiable functions G: M — t- M with E\T{X)G{X)\ < oo. The 
authors also noticed that this class includes the zero- and size- bias transformations respectively 
with m = 1, T{x) = x and m = 0, T{x) = x^ . It is easy to see that this class also includes the 
shape bias transformation (with m = 0, T{x) = x^). 

However, the properties of the shape bias transformation are formulated and proved here 
for the first time. 

2 Motivation and applications 

The zero bias transformation gives an opportunity to construct an integral estimate for the 
proximity of a ch.f. with zero mean to the normal one with the same variance in terms of the 
proximity of the corresponding zero biased distribution to the original one, which might be 
sharper than non-integral estimates based on the Taylor formula in the neighborhood of zero. 
Namely, for the sake of convenience put o"^ = 1 implying /^a ^ 1 by the Lyapounov inequality. 
Then using the elementary relations 

t ft 

2, 



^0 



f [Ee'^'' -£e'^''''')ue^"'^du, (7) 
Jo 



ft 



and the estimate for the difference of arbitrary ch.f.'s with the finite first moments due to 
Korolev and Shevtsova [8] 

|Ee^*^ - Ee^*^| ^ 2sin (^^Li(X,r) A E|X|, E|r| < oo, t € M, (8) 

a A 6 = min{a, fe}, a, 6 G M, it is not difficult to conclude that 



^ e 



-t^/2 



Ee™^ - Ee^"^*^' 



ue 



^2e-'"/^ f sin(^Li(X,X(^)) A^)Me"'/2rfM, t G M. 
Finally, applying inequality ([3]) to estimate Li(X, X*^^)) one obtain 

r{t) ^ 2e-*'/2 sin A ue^'^'du, t G M, (9) 
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for any r.v. X with EX = 0, EX^ = 1, E|X|^ = /^a < oo. Estimate (Q is exact as t — )■ 0, since 
it implies that for alH ^ and /Ss ^ 1 such that P^t ^ it/2 we have 

r(t) ^ 2 usmi^—jdu = ^^i^sm- ^ cos — j ~ t ^ 0, 

with the least possible factor 1/6, and is always sharper than the power-type estimate r{t) ^ 
Ps\t\^/6 especially for moderate (separated from zero) values of /33|t|. Note that Ps\t\ can be 
separated from zero for large enough values of /^s ^ 1 even if \t\ is small. Thus, the estimates 
for r{t) of an integral ©-type form play an important role in the construction of the least 
possible upper moment-type bounds of the accuracy of the normal approximation which should 
be uniform in some classes of distributions, especially if in these classes extremal distributions 
have large third absolute moments. This situation is typical, for example, for the problem of 
optimization of the absolute constants in the Berry-Esseen-type inequalities with an improved 
structure (see P, [101 El US US] where a smoothing inequality is applied with the subsequent 
estimation of the difference |/„(t) — e~* /^|, /„(t) being the ch.f. of the normalized sum of 
independent random variables, in terms of the difference \f{t) — e~* /^|, f{t) being the ch.f. 
of a single r.v.) and in its non-uniform analogues for sums of independent r.v.'s that use the 
Berry-Esseen inequality with an improved structure (see [Il[l5l[6]), as well as in the moment- 
type estimates of the rate of convergence in limit theorems for compound and mixed compound 
Poisson distributions (where /^s — oo, see [TT| [HI [H]) which use the Berry-Esseen inequality 
with an improved structure as well. 

The above reasoning suggests that for the moderate values of /^slt], estimates for r{t) in 
the twice-integrated form might be even sharper than estimates in the once-integrated form 
like ([9]). Since the ch.f. f{t) is supposed to be differentiable at least twice, it is possible to 
continue as 

fit) - e-'^' = e-*V2 Te-V^ T + sf{s)ydsdu = 

Jo Jo 

= e-'"/' f e^"'^ r ins) + fis) + sns))dsdu = 
Jo Jo 

or as 

/(t) _ e-*V2 = e-tV2 /■* r (/(s)e^'/2 - l)"dsdu = 

Jo Jo 

= e^''" £ (/"(s) + f{s) + sfis) + s{ns) + sf{s))y'/Hsdu = 

= (Ee^'"" - Ee*'""* + ^/'(^) + ^'(Ee*^^ - Ee''''''')^ '^ds du. 

Note that the second estimate contains the additional term s^iEe^^-^ — Ee*'^"^*"') , but the factor 
e^^/^ does not exceed the analogous factor e^^^'^ in the first one. However, for all ^ s ^ t this 
additional term satisfies 

g^{s) = s'^lEe''^ - Ee''^^^^\ ^ 2s^ <: 2^ = 0{f), t 0, 
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(actually, an even sharper estimate can be obtained, if inequalities ([8]) and ([3]) are used). If 
EX = 0, EX^ = 1, /33 = E\Xf < oo, then \f'{s) + s\ ^ and thus for all ^ s ^ t and 

bounded fi^t we have 

g,{s) = s\f'{s)\ ^s' + ^^t' + ^ = 0{t'), t ^ 0. 

So, 

sup ((72(s) + (73(s)) = 0(^2), 

while the first term 

^?i(.) = |Ee-^-Ee-^*| = |/(s) + /"(.)! 

should be equivalent to P^s as s ^ 0+ in order that the final integrated estimate should have 
the exact order I3^\t\^/Q as t — 0. Thus, it is gi{s) that determines the behavior of the final 
integral estimate for small values of s, and the problem of construction of the least possible 
bound for gi{s) is very important. Theorem [T] gives an opportunity to construct such a bound. 
Namely, the following corollaries hold. 

Corollary 1. LetX he a r.v. with the ch.f. f{t) and EX = 0, EX"^ = 1, (3^ = E\X\^ < oo. 
Then for allteR 

|/(t) + r(t)|^2sin (^A^). 

Corollary 2. Let X be a r.v. with the ch.f. f{t) and EX = 0, EX^ = I, (33 = E\Xf < oo. 
Then for allteR 

Note that, as t — 0+, the r.-h. sides of the inequalities presented in corollary [2] are equivalent 

to 

sm )asau = — r sm — < 

V 2 / (31 \ 2 2 J 



JO 



32/ (33t (33t (33t\ r . f(3su\ 

< sm — — cos — — 1=2 M sm — — du, 

(3i\ 4 4 4 J Jo V 4 7 ' 



provided that ^ (3^1 ^ vr. 

So, corollary [2] plays an important role in estimation of the rate of convergence in limit 
theorems for sums of independent random variables mentioned above. However, particular ap- 
plication of corollary [2] is the subject of a separate investigation and will be published elsewhere. 
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3 Proof of theorem I] 



The proof consists of three parts: first, we reduce the problem to the class of discrete distribu- 
tions concentrated in a finite number of points; second, we will demonstrate that the extremal 
distribution has at most 3 atoms; and finally, we consider two- and tree-point distributions. 

1. Reduction to the class of distributions concentrated in a finite number of 
points. Let X be a r.v. defined on some probability space {Q,A, P). Assume that E|Xp < oo. 
On the same probability space, construct a sequence of r.v.'s {Xn}n^i such that: 

1) for each n, Xn takes at most a finite number of values, 

2) EXrr = 0, EX^ = 1, 

3) Li(X„, -> Li(X,X*) and E\X„\^ E\X\^ as n ^ cx). 
Denote the indicator function of a set A by I{A). For n = 1, 2, ... let 

k\ ^ k^k k + 1 

^2" V 2*^ 2" / ^ 2*^ V 2" 2" 

fc=l ^ k=-n2^ ^ 

Then, evidently, Yn takes no more than n2'^~^^ values, \Yn\ ^ n. Moreover, Yn ^ X a.s., 
\Yn\ ^ |X|, E|Xp < oo, hence, by virtue of the Lebesgue dominated convergence theorem, 
E|y„p E|X|^, EY^ EX^ = 1, EYn EX = 0, and, consequently, VarF„ 1 as n -> oo. 
So, there exists a number riQ € N such that for all n ^ no we have \EYn\ ^ 1 and VarF^ ^1/4. 
Now let 

Yn - EYn 
Xn = , 77, = 1, 2, . . . 

Then X„ takes at most a finite n2"+^) number of values, EX„ = 0, EX^ = 1, X„ — )■ X a.s. 
and for all n ^ no 

|X„K^^±p^^2(|X| + l)^ry, Etj'koo, 

hence E|X„ — X|^ — )■ and E|X„p — )■ E|Xp as n ^ oo. So, it remains to show that 
Li(X„,,(X„)*) ^Li(X,X*). 

Using the triangle inequality we have 

Li(X,X*) ^Li(X,X„) + Li(X„,(XO*) + Li((X„)*,X*), 

Li(X„, (X„)*) ^ Li(X,,X) + Li(X,X*) + Li(X*, (X„)*), 
which is equivalent to 

|Li(X„, (X„)*) - Li(X,X*)| ^ Li(X,X„) + Li(X*, (X„)*). (10) 

The first term in (1101) tends to zero since 



Li(X,X„) ^ E|X-X„| ^ (E|X-X„|=^)'/'^0, n-^ 



oo. 



Consider the second term Li(X*, (X„)*). As is well known, the Li-metric can be represented 
in terms of the Ci-metric as 

Li(X,y) = Ci{X,Y) = sup{|E/(X) - E/(F)|: / G J^^}, E\X\ < oo, E|F| < oo. 
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where T\ is the set of all real-valued functions on M such that sup^_^j^ 1/(2^) ~ f{y)\l\'^ ~ Z/l ^ 1- 
Therefore, using the elementary property of the shape-biased d.f. dF*[x) = x^dF(x), x G M, 
we conclude that 

L,{X*, (X^Y) = Ci(X*, = sup \EX* - E(XO*| = sup \EX^f{X) - EXlf{X^)\. 

Fix any function / G -Fi. Without loss of generality it can be assumed that /(O) = 0, since 
EXl = EX^ = 1 and 

EX'fiX) - EXlfiX^) = EX'ifiX) - /(O)) - EX^(/(X„) - /(O)). 

Then for this function / we have \f{x) — f{y)\ ^ |x — ?/|, x, y G M, in particular, if y = 0, then 
^ and hence 

\x^f{x) -y^fiy)\ ^ x^lfjx) - f{y)\ ^ _ - y'^\ ^ 
\x — y\ ^ \x — y\ \x — y\ ^ 

^ + \y\i\x\ + \y\) =x^ + y^ + \xy\ ^ + \y\y, x^y. 

So, by the Holder inequality we obtain 

Ci(X*, ^ sup {E\X'fiX) - X2/(X„)|I(X„ ^ X) : / G J-i, /(O) = O} ^ 

^ E\X - X„|I(X„ ^ X)(|X„| + \X\f ^ E\X - + < 

^ (E|X - X„nV3(E(|X„| + |X|)3)2/3 ^0, n ^ oo, 

since lim E|X — X„p — )■ and E(|X„| -|- |X|)^ < oo (the expectations E|X„|^, E\Xf being 

n—^oo 

finite). 

Thus, the second term Li{X*, (X„)*) in (ITUI) also tends to zero as n — )■ oo. The sequence 
Xn satisfies all the conditions l)-3) and in further reasoning, to prove the theorem it suffices 
to consider only discrete distributions concentrated in a finite number of points. 

2. Reduction to the class of distributions concentrated in at most three points. 

Since for any function / G -Fi we also have (— /) G and E/(X*) = EX^/(X), 

Li(X,X*) = Ci(X,X*) = sup(E/(X) - E/(X*)) = sup E(l -X2)/(X). 
For / G -Fi denote 

J(X,/) = E((l-X2)/(X)-|Xn. 

Then 

Li(X,X*)-E|X|3= sup J(X,/), 

and the statement of the theorem is equivalent to sup J(X, /) = 0, where the supremum is 
taken over all / G -Fi and all r.v.'s X taking a finite number of values and satisfying the 
conditions EX = 0, EX^ = 1, E|X|^ < oo. As it follows from the results of W. Hoeffding [7], 
the supremum of a linear (with respect to the d.f. F{x)) functional J(X, /) over all distribution 
functions F{x) of a r.v. X concentrated in a finite number of points and satisfying two linear 
(with respect to the d.f. F{x)) equality-type conditions EX = 0, EX^ = 1 is attained at the 
distributions concentrated in at most three points. 
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3. Checking two- and three-point distributions. Here we will use one more represen- 
tation of the Li-metric in terms of the mean-metric: 



Li{X,Y) = >£{X,Y) = \P{X <u)-P{Y <u)\du, E|X| < oo, E|r| < oo. 

J — oo 

In particular, with Y = X* and P(X* < u) = EXH{X < u),we have 

/oo 
\F{u)-EXH{X <u)\du. 
'OO 

Let a r.v. X take two values and satisfy the conditions EX = 0, EX^ = 1. Then its 
distribution should necessarily have the form 



P (x = =p=i-p(x = - , g = 1 - p G (0, 1) 

It is easy to see that EX^ = (g — 'P)l\fV(i^ E|X|^ = (p^ + q^)/ ^/pq. Then 



0, M ^ -\fvh^ 

1, V(l/P<u, 

EXH{X <u)-F{u) = {p-q)l{-y^<u^ y^), 



and hence 



Li(X,X*)= f \p-q\l(-^/^<u^J^)du=^-^^-^ = \EX^\^E\Xf 
by virtue of the Jensen inequality, thus, the statement of the theorem holds. Moreover, for any 

E|X|3 l-2p + 2p^ ^ 

for aU < p < e/2. 

Now consider a r.v. X taking exactly three values. Note that 

Li(X,X*) ^ ^ . ^ rLi(X,X*)a2 



sup 



{^^i^^:EX = 0, EX^ = l}=sup{^lil^^:a>0, EX = 0, EX^ = .^}, 



where the supremums are taken over three-point distributions of a r.v. X. Let X take the 
three values, x, y, z with the probabilities p,q,r > respectively, p + q + r = 1. Without loss 
of generality one can assume that x < y ^ < z. From the conditions EX = 0, EX^ = we 
find that 

(j^ + yz cr^ + xz + xy o 

p = - -, q = —, 77 r, r = - -, -yz < a < -xz. 

{z-x){y-x) [z-y)[y-x) [z - x)[z - y) 



For all n G M we have 




u ^ X, 

EX%X <u)= I ^-^2' 2 ^ ^ 5 ^' 
1 pj.^ _|_ qy^^ y < u ^ z, 

cr^, z < u. 



a^^EXH{X <u)- F{u) 



0, M ^ X, 

•p{x^ ja^ ~ 1)) X < u ^y, 

{px^ + qy"^) / a"^ — p — q, y < u ^ z, 
0, z < u. 

Noticing that (px^ + qy'^)/cr'^ — P — Q = {'^'^ — rz'^)/a'^ — 1 + r = r(l — z'^/a'^), we obtain 



/oo 
- (t-2EX2i(X <u)\du = p 
'OO 



X 



{y — x) + r 



(z-y). 



Consider the function 



Li{X,X*)a' - E\X\^ = p{y - x)\x' - a'\ + r{z - y)\z 



2 a'\ 



px^ + qy"^ — rz^ 



z — X 



\x 



a'\ (a' + yz) 



\z — a \ [a 



2z^(a'^ + xz) 

xy) ^ '-+ 

z-y 



[z^ — x^ — xy + yz) + xyz{z — x) + 2xz^^ = g{x, y, z, o"^). 



The statement of the theorem is equivalent to snp g{x,y, z,a'^) = 0, where the sumpremum 
is taken over all cr^ > 0, x < y ^ < 2; such that —yz < < —xz. Note that it suffices 
to consider only < max{x^,2;^}, since the opposite inequality (with g > 0) implies that 
EX^ < 0"^. So, there are only three possibilities: 1) < cr^ < min{x^,2;^}, 2) x^ ^ < z^, 
3) 2;^ ^ 0"^ < x^. Opening the modules, we notice that g{x, y, z, cr^) is a parabola with respect 
to (T^ on each of the intervals specified above. Consider the behavior of g{x, y, z, cr^) on each of 
these intervals. 

1. < (T^ < min{x^,2^}, then necessarily cr^ < —xz and 

2{a^ + xy){yz^ + a\z-y)) 



g{x,y,z,a ) 



{z - y){z - x) 



The coefficient —2/{z — x) at o"'^ is negative, thus the branches of this parabola with 
respect to cx^ look down and the maximum value of the function g{z, y, z, cr^) within the 
interval < o"^ < min{x^, 2^} is attained either at the vertex 



yi^z"^ + xz — xy) 



if cr^ > —yz, or at the point cr^ — )■ —yz + 0, if cr^ ^ —yz. We have 

ai+yz=y^'^'-^yy^^'-^^Uo, 

2{z-y) 

since x < y ^ < z, with the equality attained if and only if y = 0. Thus, the supremum 
is attained as cr^ — —yz + 0, which implies that p — )■ and reduces the problem to 
checking two-point distributions considered above. 



2. 2;^ ^ (T^ < x^, then 



g{x,y,z,a'^) 



2z^{a^ + xy) 
{z - x){z - y) 



-2rz'^ < 



by virtue of the conditions r > 0, 2 > 0. 
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3. ^ < z^, then the function 

—a^{x'^{z — y) + y^{z — x) + xyz) + xyz{xy — xz — yz) 



g{x,y,z,a ) 



{z - x){z - y) 



is hnear and decreases monotonically in o"^, since — y) + y'^{z — x) + xyz > 0. Thus, 
if ^ —yz, then the supremum of g{x, y, z, cr^) is supphed by — t- —yz + 0, which 
reduces the problem to checking two-point distributions considered above. If x"^ > —yz, 
then the supremum of g{x, y, z, o"^) is attained at o"^ = x^. With this value of cr^ we have 

2\ 2x(x + y)(x'^z — yx'^ + yz'^) 

g{x,y,z,x ) = r . 

{z - x){z - y) 

d 2x '^x{x + z){yz{2z-y) + x{z-yf) 

— g{x,y,z,x = < 0, 

ay [z — x){z — yY 

since x < —z and x < y ^ Q < z. Thus, the supremum of g{x, y, z, x^) over all y such that 
X < y ^ and —yz < x^ = cr^ is supplied by y — f max{x, —x'^/z} + = -x^jz + 0, i. e., 
p — > 0, which reduces the problem to checking two-point distributions considered above. 
Thus, the theorem is completely proved. 
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